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GenCore version 6.3 
Copyright (c) 1993 - 2009 Bioccelerat ion Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



May 31, 2009, 21:47:12 ; Search time 916 Seconds 
(without alignments) 

101538.085 Million cell updates/sec 

US-10-573-229A-1 

920 

1 tctgtagaggggaatggctg acccccaaagaaaccttcta 920 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 14594359 seqs, 50548307366 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



GenEmbl : * 
gb_env : * 
gb_pat : * 
gb_phg : * 
gb_pln : * 
gb_pri : * 
gb_rod : * 
gb_sts : * 
gb_syn : * 
gb_una : * 
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10 


gb_ 


_vr 1 : * 


11 


gb_ 


_vrt : * 


12 


gb_ 


_inv : * 


13 


gb_ 


_mam : * 


14 


gb_ 


_bct : * 


15 


gb_ 


_htgl : * 


16 


gb_ 


_htg2 : * 


17 


gb_ 


_htg3 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


920 


100 


0 


920 


2 


CS074575 


CS074575 Sequence 




2 


920 


100 


0 


920 


2 


CS430220 


CS430220 Sequence 




3 


920 


100 


0 


920 


2 


DJ044325 


DJ044325 Identific 


c 


4 


422 . 2 


45 


9 


109238 


5 


AL133538 


AL133538 Human DNA 




5 


306 


33 


3 


308 


5 


Z79429 


Z79429 H. sapiens f 


c 


6 


275 


29 


9 


140243 


15 


AC067760 


AC067760 Homo sapi 


c 


7 


275 


29 


9 


147659 


15 


AC015496 


AC015496 Homo sapi 


c 


8 


275 


29 


9 


172212 


5 


AL 13 610 0 


AL136100 Human DNA 




9 


226 . 8 


24 


7 


186029 


16 


AC169795 


AC169795 Macaca mu 


c 


10 


178.2 


19 


4 


390 


2 


CS074841 


CS074841 Sequence 


c 


11 


178.2 


19 


4 


390 


2 


DJ044524 


DJ044524 Identific 


c 


12 


176.6 


19 


2 


390 


2 


CS430486 


CS430486 Sequence 




13 


122 . 6 


13 


3 


561 


2 


AR6 12609 


AR612609 Sequence 




14 


122.6 


13 


3 


561 


2 


AR799123 


AR799123 Sequence 




15 


122.6 


13 


3 


561 


2 


DD052947 


DD052947 SINGLE CO 




16 


121.2 


13 


2 


541 


2 


AR6 12608 


AR612608 Sequence 




17 


121.2 


13 


2 


541 


2 


AR799122 


AR799122 Sequence 




18 


121.2 


13 


2 


541 


2 


DD052946 


DD052946 SINGLE CO 


c 


19 


120 


13 


0 


124409 


5 


AC069360 


AC069360 Homo sapi 




20 


120 


13 


0 


165031 


5 


AC023850 


AC023850 Homo sapi 


c 


21 


114.2 


12 


4 


769 


7 


BV595649 


BV595649 S215P6066 


c 


22 


112.6 


12 


2 


92029 


17 


AL357492 


AL357492 Homo sapi 




23 


112.6 


12 


2 


111182 


5 


AL359771 


AL359 7 71 Human DNA 


c 


24 


111 . 8 


12 


2 


171675 


17 


CU694921 


CU694921 Sus scrof 




25 


109.6 


11 


9 


180602 


5 


AC010873 


AC010873 Homo sapi 




26 


109.6 


11 


9 


186244 


17 


CU467509 


CU467509 Sus scrof 


c 


27 


109.6 


11 


9 


197898 


17 


CT826373 


CT826373 Sus scrof 




28 


108 


11 


7 


142859 


5 


AC008663 


AC008663 Homo sapi 


c 


29 


108 


11 


7 


201170 


15 


AC022259 


AC022259 Homo sapi 


c 


30 


105.4 


11 


5 


241493 


17 


AC213558 


AC213558 Equus cab 




31 


105 


11 


4 


117916 


17 


AC225094 


AC225094 Loxodonta 




32 


105 


11 


4 


123795 


17 


AC218085 


AC218085 Loxodonta 


c 


33 


105 


11 


4 


198249 


16 


AC165117 


AC165117 Bos tauru 
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34 


104. 


. 8 


11. 


. 4 


737 


2 


EA735231 


EA735231 Sequence 




35 


104. 


.8 


11. 


.4 


797 


2 


BD419182 


BD419182 50 Human 




36 


104. 


.8 


11. 


.4 


797 


2 


EA731889 


EA731889 Sequence 


c 


37 


104. 


.8 


11. 


.4 


137000 


2 


AR542499 


AR542499 Sequence 


c 


38 


104. 


.8 


11, 


.4 


145966 


5 


AC006451 


AC006451 Homo sapi 


c 


39 


104. 


.8 


11. 


.4 


178050 


5 


AC142324 


AC142324 Pan trogl 


c 


40 


103. 


.4 


11. 


.2 


176644 


5 


AP002982 


AP002982 Homo sapi 


c 


41 


103 . 


.4 


11. 


.2 


201190 


15 


AC022256 


AC022256 Homo sapi 




42 


103. 


.2 


11 . 


.2 


183899 


16 


AC151854 


AC151854 Papio ham 




43 


103. 


.2 


11 . 


.2 


206939 


16 


AC174979 


AC174979 Bos tauru 


c 


44 


103 . 


.2 


11 , 


.2 


208220 


17 


AC213940 


AC213940 Equus cab 


c 


45 


102. 


.8 


11, 


.2 


163782 


15 


AC067991 


AC067991 Homo sapi 



ALIGNMENTS 



CS074575 920 bp DNA 

Sequence 1 from Patent WO2005030250 . 
CS074575 

CS074575.1 GI:63091182 



PAT 05-MAY-20 05 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 

Tuereci,0., Sahin,U., Helf tenbein, G . and Schlueter,V. 
Identification of tumour-associated cell surface antigens for 
diagnosis and therapy 

Patent: WO 2005030250-A 1 07-APR-2005; 
Ganymed Pharmaceuticals AG (DE) 

Location/ Qualifiers 

1. .920 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



Query Match 100.0%; Score 920; DB 2; Length 920; 

Best Local Similarity 100.0%; Pred. No. 7.4e-275; 

Matches 920; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 
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6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 

2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 3 0 0 

3 01 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 01 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 

361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 54 0 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 6 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 6 0 0 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 66 0 

661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

7 81 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 4 0 
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8 41 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 9 0 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
8 41 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 9 0 0 

901 ACCCCCAAAGAAACCTTCTA 92 0 

I I I I I I I I I I I I I I I I I I I I 
901 ACCCCCAAAGAAACCTTCTA 92 0 



CS430220 920 bp DNA 

Sequence 1 from Patent WO2006100089 . 
CS430220 

CS430220.1 GI:116291601 



PAT 19-OCT-2006 



Euteleostomi; 
Haplorrhini ; 



Usener,D. and 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebra 
Mammalia; Eutheria; Euarchontoglires ; Primates; 
Catarrhini; Hominidae; Homo. 
EFERENCE 1 

AUTHORS Sahin,U., Tuereci,0., Koslowski , M . , Helf tenbein, G . , 
Schlueter, V. 

TITLE Identification of surface-associated antigens for tumor diagnosis 

and therapy 

JOURNAL Patent: WO 2006100089-A 1 28-SEP-2006; 
Ganymed Pharmaceuticals AG (DE) 
Location/ Qualifiers 
1. .920 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref ="taxon : 96 06" 



Query Match 100.0%; Score 920; DB 2; Length 920; 

Best Local Similarity 100.0%; Pred. No. 7.4e-275; 

Matches 920; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 



121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 

2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 3 0 0 

3 01 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 01 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 

361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 54 0 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 60 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 6 0 0 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 66 0 

661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

7 81 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 4 0 

8 41 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 9 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

8 41 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 9 0 0 

901 ACCCCCAAAGAAACCTTCTA 92 0 
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I I I I I I I I I I I I I I I I I I I I 
Db 901 ACCCCCAAAGAAACCTTCTA 92 0 



DJ044325 920 bp DNA linear PAT 26-OCT-2007 

Identification of Tumour-Associated Cell Surface Antigens for 
Diagnosis and Therapy. 
DJ044325 

DJ044325.1 GI:158737401 
JP 2007506417-A/l . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 920) 

Heruf utembain, G . , Shuryuta,V., Sahin,U. and Tureci,0. 

Identification of Tumour-Associated Cell Surface Antigens for 
Diagnosis and Therapy 

Patent: JP 2007506417-A 1 22-MAR-2007; 
Ganymed Pharmaceuticals AG 
OS Homo sapiens 

JP 2007506417-A/l 

22- MAR-2007 

23- SEP-2004 JP 2006527354 
26-SEP-2003 DE 10344799.7 

geert heruf utembain, volker shuryuta, ugur sahin, oezlem tureci 



Key Location/Qualifiers. 
Location /Qualifiers 
1. .920 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



Query Match 100.0%; Score 920; DB 2; Length 920; 

Best Local Similarity 100.0%; Pred. No. 7.4e-275; 

Matches 920; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 



6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 
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Qy 121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 

2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 3 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 3 0 0 

3 01 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3 01 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 

361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 54 0 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 6 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 6 0 0 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 66 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 66 0 

661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

7 81 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 4 0 

8 41 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 9 0 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

8 41 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 9 0 0 
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Qy 901 ACCCCCAAAGAAACCTTCTA 92 0 

I I I I I I I I I I I I I I I I I I I I 
Db 901 ACCCCCAAAGAAACCTTCTA 92 0 



AL133538 109238 bp DNA linear PRI 07-MAY-2008 

Human DNA sequence from clone RP11-550N16 on chromosome 6q26-27 
Contains a CpG island, complete sequence. 
AL133538 

AL133538.16 GI : 12666199 
HTG; CpG island. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 109238) 
AUTHORS Wallis,J. 
TITLE Direct Submission 

JOURNAL Submitted ( 3 O-APR-2 0 0 8 ) Wellcome Trust Sanger Institute, Hinxton, 
Cambridgeshire, CB10 ISA, UK. E-mail enquiries: vega@sanger.ac.uk 
Clone requests: Geneservice (http://www.geneservice.co.uk/) and 
BACPAC Resources (http://bacpac.chori.org/) 
COMMENT On Feb 5, 2001 this sequence version replaced gi: 12655232. 

Genome Center 

Center: Wellcome Trust Sanger Institute 
Center code: SC 

Web site: http://www.sanger.ac.uk 
Contact: vega@sanger.ac.uk 



This sequence was finished as follows unless otherwise noted: all 
regions were either double-stranded or sequenced with an alternate 
chemistry or covered by high quality data (i.e., phred quality >= 
30); an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by at least 
one subclone; and the assembly was confirmed by restriction digest, 
except on the rare occasion of the clone being a YAC . 
The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases: 
Em:, EMBL; Sw:, SWISSPROT; Tr : , TREMBL; Wp : , WORMPEP; Information 
on the WORMPEP database can be found at 
http : //www . Sanger . ac . uk/Pro jects/C_elegans/wormpep 

This sequence was generated from part of bacterial clone contigs of 
human chromosome 6, constructed by the Sanger Centre Chromosome 6 
Mapping Group. Further information can be found at 
http : //www . Sanger . ac . uk/HGP/Chr6 

RP11-550N16 is from the library RPCI-11.2 constructed by the group 
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of Pieter de Jong. For further details see 
http : //bacpac . chori . org/ 
VECTOR: pBACe3 . 6 . 
FEATURES Location/Qualifiers 
source 1. .109238 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
/ chromosome="6" 
/map="q2 6-2 7" 
/clone="RPll-55 0N16" 
/clone_lib="RPCI-ll . 2" 
misc_feature 109039 

/note="Clone_left_end: RP1-2 63021" 



Query Match 45.9%; Score 422.2; DB 5; Length 109238; 

Best Local Similarity 94.5%; Pred. No. 3.1e-119; 

Matches 460; Conservative 0; Mismatches 23; Indels 4; Gaps 2; 

Qy 25 7 CCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAAGATCCCAGTAGGGCAG 316 

I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 96122 CTCAAATGCTTTCTCATAGACCTCCATCAGGTGTCGACAAGGAAGATCCCAGTAGGGCAG 96063 

Qy 317 GAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTG 3 76 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 96062 GAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTG 96003 

Qy 3 77 GGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTA 43 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 96002 GGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTA 95943 

Qy 43 7 AGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCA 496 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95942 AGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCA 95883 

Qy 49 7 AAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTAAAACCCTCCCTGCCCC 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 958 82 AAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTAAAACCCTCCCTGCCCC 95 8 23 

Qy 55 7 AGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATCGAGAGACCTCTAACCC 616 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95822 AGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATCGAGAGACCTCTAACCC 95 763 

Qy 617 TGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAACAACACAAGGGAAGTA 6 76 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 9 5 762 TGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAACAACACAAGGGAAGTA 95703 

Qy 6 77 CCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCCCAAGAGCCAGCCCCGA 73 6 

I I I I I I I I I III I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 95702 CCTGCTGG TTCTGGGGTTGGGGAGGAAGATCCCTACTG-CCCAAGAGCCAGCACAGA 95647 
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Qy 73 7 ACCCAAG 743 

I I I 

Db 95646 CACAAGG 95640 



RESULT 5 

Z79429 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



DNA linear PRI 23-AUG-1996 

TaqI fragment, SC6pA8B6 . 



Z79429 308 bp 

H. sapiens flow-sorted chromosome 
Z79429 

Z79429.1 GI:1508707 
Anonymous marker; single read. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 308) 

Mungall, A. J. , Huckle,E., Langford,C, Ross,M.T. and Rice, CM. 
Direct Submission 

Submitted (22-AUG-1996 ) The Sanger Centre, Wellcome Trust Genome 
Campus, Hinxton, Cambridgeshire, CB10 ISA, UK. E-mail contact: 
humquery@sanger .ac.uk 
Vector: pBSIISK+. 

Location /Qualifiers 

1. .308 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
/chromosome="6 " 
/clone="SC6pA8B6" 
/sex="f emale" 

/tissue_type="EBV lymphoblastoid cell line" 

/clone_lib="SC6pA" 

/dev_stage=" adult" 

/note="The estimated purity of the flow-sorted chromosome 
6 library is >97%" 



Query Match 33.3%; 
Best Local Similarity 99.4%; 
Matches 306; Conservative 



Score 306; DB 5; 
Pred. No. 2.5e-83; 
0; Mismatches 2; 



Length 308; 



Indels 



0; Gaps 



0; 



2 93 ACAAGGAAGATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAA 352 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ACAAGGAANATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAA 6 0 



3 53 TGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCT 412 
I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 61 TGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCT 12 0 



413 TTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCC 4 72 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 TTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCC 18 0 

4 73 CATGCCAATTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAA 53 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 CATGCCAATTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAA 2 4 0 

533 CGGGGGTAAAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAG 59 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 41 CGGGGGTAAAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAG 3 0 0 

5 93 GTAGAATC 6 00 

III I I I I 
301 GTANAATC 3 08 



AC067760 140243 bp DNA linear HTG 23-SEP-2000 

Homo sapiens chromosome 2 clone RP11-309K2 map 2, WORKING DRAFT 
SEQUENCE, 27 unordered pieces. 
AC067760 

AC067760.2 GI:10280923 
HTG; HTGS_PHASE1; HTGS_DRAFT . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 140243) 

AUTHORS Birren,B., Linton, L . , Nusbaum, C. and Lander, E. 

TITLE Homo sapiens chromosome 2, clone RP11-309K2 

JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 140243) 

AUTHORS Birren,B., Linton, L., Nusbaum, C, Lander, E., Abraham, H., Allen, N., 
Anderson, S., Baldwin, J., Barna,N., Bastien,V., Beda,F., 
Bogus lavkiy, L . , Boukhgalter , B . , Brown, A., Burkett,G., 
Campopiano, A. , Castle, A., Choepel,Y., Colangelo , M . , Collins, S., 
Collymore, A. , Cooke, P., DeArellano, K . , Dewar,K., Diaz, J. S., 
Dodge, S., Domino, M., Doyle, M. , Ferreira,P., FitzHugh,W., Gage,D., 
Galagan,J., Gardyna,S., Ginde,S., Goyette, M. , Graham, L., 
Grand-Pierre, N . , Grant, G., Hagos,B., Heaford,A., Horton,L., 
Howland, J . C . , Iliev,I., Johnson, R., Jones, C, Kann,L., Karatas,A., 
Klein, J., LaRocque,K., Lamazares , R . , Landers, T., Lehoczky,J., 
Levine,R., Lieu,C, Liu,G., Locke, K., Macdonald, P . , Marquis, N., 
McCarthy, M., McEwan,P., McGurk,A., McKernan,K., McPheeters , R . , 
Meldrim,J., Meneus,L., Mihova,T., Miranda, C, Mlenga,V., Morrow, J., 
Murphy, T., Naylor,J., Norman, C.H., O'Connor, T., 0 ' Donnell , P . , 
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0'Neil,D., Olivar, T .M. , Oliver, J., Peterson, K., Pierre, N., 
Pisani,C, Pollara,V., Raymond, C, Riley, R., Rogov,P., Rothman,D., 
Roy, A., Santos, R., Schauer,S., Severy,P., Spencer, B., 
Stange-Thomann, N . , Sto j anovic, N . , Subramanian, A . , Talamas,J., 
Tesfaye,S., Theodore, J . , Tirrell,A., Travers , M . , Trigilio,J., 
Vassiliev, H . , Viel,R., Vo,A., Wilson, B., Wu,X., Wyman,D., Ye,W.J., 
Young, G., Zainoun,J., Zimmer,A. and Zody,M. 
TITLE Direct Submission 

JOURNAL Submitted (27-APR-2000) Whitehead Institute/MIT Center for Genome 
Research, 320 Charles Street, Cambridge, MA 02141, USA 
OMMENT On Sep 23, 2000 this sequence version replaced gi: 7651809. 

All repeats were identified using RepeatMasker : 
Smit, A.F.A. & Green, P. (1996-1997) 

http : / / ftp . genome . Washington . edu/RM/RepeatMasker . html 
Genome Center 

Center: Whitehead Institute/ MIT Center for Genome Research 

Center code: WIBR 

Web site: http://www-seq.wi.mit.edu 

Contact : sequence_submissions@genome . wi .mit . edu 

Project Information 

Center project name: L9986 
Center clone name: 3 0 9_K_2 

Summary Statistics 

Sequencing vector: M13; M77815; 100% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.960731 
Consensus quality: 128933 bases at least Q40 
Consensus quality: 134958 bases at least Q30 
Consensus quality: 136879 bases at least Q20 
Insert size: 148000; agarose-fp 
Insert size: 137643; sum-of -cont igs 
Quality coverage: 3.5 in Q20 bases; agarose-fp 
Quality coverage: 3.8 in Q20 bases; sum-of -cont igs 



NOTE: This is a 'working draft' sequence. It currently 
consists of 27 contigs. The true order of the pieces 
is not known and their order in this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is available and the accession number will 



be preserved. 














1 


4248 : 


contig 


of 4248 


bp 


in 


length 


4249 


4348 : 


gap of 


100 bp 








4349 


5605: 


contig 


of 1257 


bp 


in 


length 


5606 


5705: 


gap of 


100 bp 








5706 


7181 : 


contig 


of 1476 


bp 


in 


length 


7182 


7281: 


gap of 


100 bp 








7282 


9012: 


contig 


of 1731 


bp 


in 


length 


9013 


9112 : 


gap of 


100 bp 
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9113 


10509 


contig 


of 1397 


bp 


in 


length 


* 


10510 


10609 


gap of 


100 bp 










10610 


13120 


contig 


of 2511 


bp 


in 


length 


* 


13121 


13220 


gap of 


100 bp 










13221 


15323 


contig 


of 2103 


bp 


in 


length 




15324 


15423 


gap of 


100 bp 








* 


15424 


17407 


contig 


of 1984 


bp 


in 


length 




17408 


17507 


gap of 


100 bp 








* 


17508 


20878 


contig 


of 3371 


bp 


in 


length 




20879 


20978 


gap of 


100 bp 








* 


20979 


22666 


contig 


of 1688 


bp 


in 


length 




22667 


22766 


gap of 


100 bp 








* 


22767 


26227 


contig 


of 3461 


bp 


in 


length 




26228 


26327 


gap of 


10 0 bp 








* 


26328 


29552 


contig 


of 3225 


bp 


in 


length 




29553 


29652 


gap of 


100 bp 








* 


29653 


33144 


contig 


of 3492 


bp 


in 


length 




33145 


33244 


gap of 


10 0 bp 








* 


33245 


35959 


contig 


of 2715 


bp 


in 


length 


* 


35960 


36059 


gap of 


100 bp 








* 


36060 


40873 


contig 


of 4814 


bp 


in 


length 


* 


40874 


40973 


gap of 


100 bp 










40974 


46108 


contig 


of 5135 


bp 


in 


length 




46109 


46208 


gap of 


100 bp 








* 


46209 


50453 


contig 


of 4245 


bp 


in 


length 




50454 


50553 


gap of 


100 bp 








* 


50554 


54780 


contig 


of 4227 


bp 


in 


length 




54781 


54880 


gap of 


100 bp 








* 


54881 


60935 


contig 


of 6055 


bp 


in 


length 


* 


60936 


61035 


gap of 


10 0 bp 








* 


61036 


68604 


contig 


of 7569 


bp 


in 


length 


* 


68605 


68704 


gap of 


100 bp 








* 


68705 


75243 


contig 


of 6539 


bp 


in 


length 




75244 


75343 


gap of 


10 0 bp 








* 


75344 


81214 


contig 


of 5871 


bp 


in 


length 




81215 


81314 


gap of 


100 bp 










81315 


87663 


contig 


of 6349 


bp 


in 


length 




87664 


87763 


gap of 


100 bp 








* 


87764 


94389 


contig 


of 6626 


bp 


in 


length 




94390 


94489 


gap of 


100 bp 








* 


94490 


105834 


contig 


of 11345 bp 


ir 


length 




105835 


105934 


gap of 


100 bp 










105935 


125401 


contig 


of 19467 bp 


in length 




125402 


125501 


gap of 


100 bp 










125502 


140243 


contig 


of 14742 bp 


in length 



FEATURES Location/Qualifiers 
source 1. .140243 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref ="taxon : 9606" 
/chromosome="2 " 
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/map="2" 

/clone="RPll-3 0 9K2" 

/clone_lib="RPCI-ll Human Male BAC" 
misc_feature 1. .4248 

/ note=" as sembly_f ragment ; 

clone_end : SP6 ; 

vector_side : left" 
gap 4249. .4348 

/ estimated_length=10 0 
misc_feature 4349. .5605 

/ note="assembly_f ragment " 
gap 5606. .5705 

/ estimated_length=100 
misc_feature 5706. .7181 

/ note=" as sembly_f ragment " 
gap 7182. .7281 

/estimated_length=10 0 
misc_feature 7282. .9012 

/ note=" as sembly_f ragment " 
gap 9013. .9112 

/estimated_length=10 0 
misc_feature 9113. .10509 

/ no te=" as sembly_f ragment " 
gap 10510. .10609 

/estimated_length=100 
misc_feature 10610. .13120 

/ note=" as sembly_f ragment " 
gap 13121. .13220 

/estimated_length=10 0 
misc_feature 13221. .15323 

/ no te=" as sembly_f ragment " 
gap 15324. .15423 

/estimated_length=10 0 
misc_feature 15424. .17407 

/ note="assembly_f ragment " 
gap 17408. .17507 

/estimated_length=10 0 
misc_feature 17508. .20878 

/ no te=" as sembly_f ragment " 
gap 20879. .20978 

/estimated_length=10 0 
misc_feature 20979. .22666 

/ note=" as sembly_f ragment " 
gap 22667. .22766 

/estimated_length=10 0 
misc_feature 22767. .26227 

/ note=" as sembly_f ragment " 
gap 26228. .26327 

/estimated_length=10 0 
misc_feature 26328. .29552 
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misc_f eature 



misc_f eature 



misc_f eature 



misc_f eature 



/note="assembly_f r, 
29553. .29652 
/estimated_length= 
29653. .33144 
/note="assembly_f r 
33145. .33244 
/ estimated_length 
33245. .35959 
/ note="assembly_f r. 
35960. .36059 
/ estimated_length-- 
36060. .40873 
/ note="assembly_f r. 
40874. .40973 
/estimated_length^ 
40974. .46108 
/note="assembly_f r. 
46109. .46208 
/estimated_length: 



agment " 
100 

agment" 
100 

agment" 
100 

agment" 
100 

agment" 
100 



Query Match 29.9%; Score 275; DB 15; 

Best Local Similarity 100.0%; Pred. No. 2.7e-73; 
Matches 2 75; Conservative 0; Mismatches 0; 



Length 140243; 
Indels 0; Gaps 



0; 



Qy 



TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 5172 9 



Qy 

Db 

Qy 
Db 



CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 5166 9 

TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 516 0 9 



Qy 
Db 
Qy 
Db 



181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
516 0 8 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 515 4 9 

241 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAG 275 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
51548 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAG 51514 



AC015496 147659 bp DNA linear HTG 16-MAR-2000 

Homo sapiens clone RP11-21E5, WORKING DRAFT SEQUENCE, 7 unordered 
pieces . 
AC015496 
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AC015496.3 GI:7249030 
HTG; HTGS_PHASE1 ; HTGS_DRAFT . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 147659) 

AUTHORS Birren,B., Linton, L . , Nusbaum, C. and Lander, E. 

TITLE Homo sapiens, clone RP11-21E5 

JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 147659) 

AUTHORS Birren,B., Linton, L., Nusbaum, C, Lander, E., Allen, N., Anderson, M. 

Baldwin, J., Barna,N., Beckerly,R., Boguslavkiy , L . , Boukhgalter , B . , 
Brown, A., Castle, A., Colangelo, M . , Collins, S., Collymore, A. , 
Cooke, P., DeArellano, K . , Dewar,K., Domino, M., Donelan,L., Doyle, M. 
Ferreira,P., FitzHugh,W., Forrest, C, Funke,R., Gage,D., 
Galagan,J., Gardyna,S., Grant, G., Hagos,B., Heaford,A., Horton,L., 
Howland, J . C . , Johnson, R., Jones, C., Kann,L., Karatas,A., Klein, J., 
Lehoczky,J., Lieu,C, Locke, K., Macdonald, P . , Marquis , N . , 
McEwan,P., McGurk , A . , McKernan,K., McLaughlin, J . , Meldrim,J., 
Morrow, J., Naylor,J., Norman, C.H., O'Connor, T., 0 ' Donnell , P . , 
Peterson, K., Pollara,V., Riley,R., Roy, A., Santos, R., Severy,P., 
Stange-Thomann, N . , Sto j anovic, N . , Subramanian, A . , Talamas,J., 
Tesfaye,S., Tirrell,A., Vassiliev, H . , Vo,A., Wheeler, J., Wu,X., 
Wyman,D., Ye,W.J., Zimmer,A. and Zody,M. 

TITLE Direct Submission 

JOURNAL Submitted ( 16-NOV-1999 ) Whitehead Institute/MIT Center for Genome 
Research, 320 Charles Street, Cambridge, MA 02141, USA 
COMMENT On Mar 16, 2000 this sequence version replaced gi: 6553986. 

All repeats were identified using RepeatMasker : 
Smit, A.F.A. & Green, P. (1996-1997) 

http : //f tp . genome . Washington . edu/RM/RepeatMasker . html 
Genome Center 

Center: Whitehead Institute/ MIT Center for Genome Research 

Center code: WIBR 

Web site: http://www-seq.wi.mit.edu 

Contact : sequence_submissions@genome . wi .mit . edu 

Project Information 

Center project name: L3993 
Center clone name: 21_E_5 

Summary Statistics 

Sequencing vector: M13; M77815; 100% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.960731 
Consensus quality: 129300 bases at least Q40 
Consensus quality: 140951 bases at least Q30 
Consensus quality: 145175 bases at least Q20 
Insert size: 146000; agarose-fp 
Insert size: 147059; sum-of -cont igs 
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Quality coverage: 5.8 in Q20 bases; agarose-fp 
Quality coverage: 5.7 in Q20 bases; sum-of -cont igs 



NOTE: This is a 'working draft' seguence . It currently 
consists of 7 contigs. The true order of the pieces 
is not known and their order in this seguence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished seguence 
as soon as it is available and the accession number will 
be preserved. 





1 


2851 : 


contig 


of 2851 bp in length 




2852 


2951: 


gap of 


100 bp 




2952 


6911: 


contig 


of 3960 bp in length 




6912 


7011 : 


gap of 


100 bp 




7012 


13475: 


contig 


of 6464 bp in length 




13476 


13575: 


gap of 


100 bp 




13576 


29402 : 


contig 


of 15827 bp in length 




29403 


29502: 


gap of 


100 bp 


* 


29503 


54805: 


contig 


of 25303 bp in length 




54806 


54905: 


gap of 


100 bp 




54906 


87698 : 


contig 


of 32793 bp in length 




87699 


87798 : 


gap of 


100 bp 




87799 


147659: 


contig 


of 59861 bp in length 



FEATURES Location/Qualifiers 
source 1. .147659 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref ="taxon : 96 06" 
/clone="RPll-21E5" 

/clone_lib="RPCI-ll Human Male BAC" 
misc_feature 1. .2851 

/note="assembly_f ragment " 
gap 2852. .2951 

/estimated_length=10 0 
misc_feature 2952. .6911 

/ no te=" as sembly_f ragment ; 

clone_end : T7 ; 

vector_side : right" 
gap 6912. .7011 

/estimated_length=10 0 
misc_feature 7012. .13475 

/ note=" as sembly_f ragment " 
gap 13476. .13575 

/estimated_length=10 0 
misc_feature 13576. .29402 

/ note=" as sembly_f ragment " 
gap 29403. .29502 

/ estimated_length=100 
misc_feature 29503. .54805 

/ note="assembly_f ragment " 
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gap 54806. .54905 

/estimated_length=10 0 
misc_feature 54906. .87698 

/note="assembly_f ragment " 
gap 87699. .87798 

/estimated_length=10 0 
misc_feature 87799. .147659 

/ note=" as sembly_f ragment ; 

clone_end : SP6 ; 

vector_side : right" 



Query Match 29.9%; Score 275; DB 15; Length 147659; 

Best Local Similarity 100.0%; Pred. No. 2.8e-73; 

Matches 2 75; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 41178 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 41119 

Qy 61 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 41118 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 41059 

Qy 121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 41058 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 40999 

Qy 181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 40998 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 40 93 9 

Qy 241 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAG 275 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 0 938 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAG 40904 



AL136100 172212 bp DNA linear PRI 06-MAY-2008 

Human DNA sequence from clone RP11-534P19 on chromosome 6 Contains 
a pseudogene similar to zinc finger proteins, complete sequence. 
AL136100 

AL136100.12 GI:11322762 
HTG; zinc finger. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
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REFERENCE 1 (bases 1 to 172212) 
AUTHORS Bird,C. 
TITLE Direct Submission 

JOURNAL Submitted ( 3 O-APR-2 0 0 8 ) Wellcome Trust Sanger Institute, Hinxton, 
Cambridgeshire, CB10 ISA, UK. E-mail enquiries: vega@sanger.ac.uk 
Clone requests: Geneservice (http://www.geneservice.co.uk/) and 
BACPAC Resources (http://bacpac.chori.org/) 
COMMENT On Nov 23, 2000 this sequence version replaced gi: 11139882. 

Genome Center 

Center: Wellcome Trust Sanger Institute 
Center code: SC 

Web site: http://www.sanger.ac.uk 
Contact: vega@sanger.ac.uk 



This sequence was finished as follows unless otherwise noted: all 
regions were either double-stranded or sequenced with an alternate 
chemistry or covered by high quality data (i.e., phred quality >= 
30); an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by at least 
one subclone; and the assembly was confirmed by restriction digest, 
except on the rare occasion of the clone being a YAC . 
The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases: 
Em:, EMBL; Sw:, SWISSPROT; Tr : , TREMBL; Wp : , WORMPEP; Information 
on the WORMPEP database can be found at 
http : //www. sanger . ac . uk/Pro jects/C_elegans/wormpep 

This sequence was generated from part of bacterial clone contigs of 
human chromosome 6, constructed by the Sanger Centre Chromosome 6 
Mapping Group. Further information can be found at 
http : //www . sanger . ac . uk/HGP/Chr6 

RP11-534P19 is from the library RPCI-11.2 constructed by the group 
of Pieter de Jong. For further details see 
http : / /bacpac . chori . org/ 
VECTOR: pBACe3 . 6 . 
FEATURES Location/Qualifiers 
source 1. .172212 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9606" 
/chromosome="6 " 
/clone="RPll-53 4P19" 
/clone_lib="RPCI-ll .2" 
misc_feature 101. .172212 

/note="annotated region of clone" 
gene complement ( 26 702 . .26930) 

/locus_tag="RPll-534P19 . 1-001" 
/pseudo 

CDS complement (26 702 . .26930) 

/locus_tag="RPll-53 4P19 .1-001" 

/note="match: proteins: Sw:Q96HQ0 Sw:Q96HQ0 Tr:O43340 
Tr : Q96DL8 Tr:Q9P220" 
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/pseudo 

/codon_start=l 
misc_feature 172112 



/note="Clone_right_end: RP11-53 4P19" 



ORIGIN 



Query Match 29.9%; Score 275; DB 5; Length 172212; 

Best Local Similarity 100.0%; Pred. No. 2.8e-73; 

Matches 2 75; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

y 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 93324 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 93265 

y 61 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 93264 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 93205 

y 121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 93204 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 93145 

y 181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 93144 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 93085 

y 2 41 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAG 2 75 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 93084 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAG 93050 



DNA linear HTG 12-NOV-2005 

WORKING DRAFT SEQUENCE, 3 



AC169795 186029 bp 

Macaca mulatta clone CH250-283M17 
ordered pieces. 
AC169795 

AC169795.2 GI:82174563 

HTG; HTGS_PHASE2; HTGS_DRAFT; HTGS_FULLTOP . 
Macaca mulatta (rhesus monkey) 
Macaca mulatta 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Cercopithecidae; Cercopithecinae ; Macaca. 
1 (bases 1 to 186029) 

Muzny , D . M . , Abraham, K . K . , Abulimit i , A . , Adams, C.Q., Aduba,G., 
Allen, C.C., Alsbrooks , S . L . , Anosike, U . S . , Archer, P.M., 
Arredondo, H . H . , Attaway,T., Bandaranaike, D . P . , Bangura,L., 
Barton, S.R., Bell, A. V., Bell, S.N. , Beraducci , A . R . , Bickham,C, 
Biswalo,K., Blyth,P.R., Buhay,C.J., Canada, A., Cardenas, V., 
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TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 



Carter, K., Chacko,J., Chandrabose,M.N. , Chavez, A., Chavez, D., 
Chen,G., Chen,R., Chu,H., Clerc blankenburg, K . P . , Cockrell,R., 
Cooper, J. A., Coyle,M.D., Cree,A., Cueto,C.B., Curry, S.M., Dai,W., 
Dao , M . D . , Davila,M., Davis, C, Davy-Carroll , L . , Del fierro,P., 
Demen,R., Denson,S., Ding,Y., Dinh,H.H., Donlin,J.E., 
Dugan-Rocha, S . , Dunn, A.M., Durbin,K.J., Ebong,V.E., Egan,A., 
Espinosa, V . C . , Fa,M. , Fernandez , S . , Fernando, P . R . , Ferrer, A. R., 
Flagg,N., Forbes, L.D., Fowler, R.G., Fu,Q., Fuh, E . , Gabisi,R.A., 
Ganardhanan, M. , Ganer,J., Garcia iii,R.M., Garcia, A.M., 
Garcia, S.M., Garner, T.T., Ghose,S., Gingras,M., 

Gonzalez-Garay, M. L . , Guevara, W.V. , Haaland, W . C . , Haeberlen, K . A . , 

Hagans,B.J., Hall,0., Hamid,H., Hamilton, K . A . , Hampton, 0 . A . , 

Harbes,B.A., Harris, R. A., Havlak,P., Hawes,A.C, Hawkins , E . S . , 

Haynes,S.J., Hemphill, L., Hernandez , J . , Hines,S., Hirani,K., 

Hitchens,M.E. , Hodgson, A. V. , Hogues,M.E., Holder, M., Hollins,B., 

Howell, L.L., Hulyk,S.W., Hume, J., Jackson, A., Jackson, L . R . , 

Jacob, S.K., Jhangiani , S . N . , Jiang, H., Johnson, B., Johnson, R., 

Joshi,V., Joy,C, Kaikai,F.B., Kalaf us , K . J . , Kalu,J.B., Kang,Y., 

Keebler,J., Khan,Z.M., Kidwai,S., King,L.M., Kisamo,H., Kovar,C.L., 

Kowis,A.N., Kowis,C.R., Lago, L . A . , Lago, M . T . , Lai,C, Lara,F., 

Le , T . T . , Lee,S.L., Lee,T.W., Legall iii,F.H., Lemon, S.J. , 

Lewis, L.R., Li , B . , Li , Y . , Li,Z., Linnell , M . A . , Liu, J., Liu,W., 

Liu, Y . , Liu, Y . , Liyanage,D., London, P., Lopez, J., Lorensuhewa, L .M. , 

Lozado,R.J., Luc,T., Madu,R.C, Maheshwari, M. , Maheshwari , R . , 

Malloy,K., Mansouri, D.L. , Martinez,E., Mate jkova, P . , Mathew,T., 

Mccauley, S . K . , Mcpherson, J. D. , Mercado,C, Mercado, I . C . , 

Metzker, M. L . , Millin,A., Milosavl jevic, A. , Morgan, M.B., Morris, S., 

Munidasa,M., Murray, D.D., Muzny , D . M . , Nazareth, L . V . , Ngo,D.N., 

Nguyen, H.T., Nguyen, N.B., Nguyen, P. Q., Nwaokelemeh, 0 . 0 . , 

Obregon,M., Odeh,E.A., Okonkwo,F., Okwuonu, G . 0 . , Okwuonu, K . C . , 

Onyenekwe, J. , Parish, B. J., Parker, D.N. , Parra,A.A., Pasternak, S . , 

Patel,B.M., Patel,R.R., Paul, H. A., Perez, A., Perez, L.M., 

Perez, Y.Y., Pham,T.L., Player, E. J., Primus, E.L., Pu,L., Puazo,M., 

Purkiss,C, Qin,X., Quiroz,J.B., Rabata,D., Rachlin, E . K . , Ren,Y., 

Richards, S., Rojas,A., Ruiz,S., Sabo,A., Sant ibanez , J . , 

Savery,G.G., Scherer , S . E . , Schneider, B .W. , Sebasigari , R . , 

Sexton, M.M., Shen,H., Shen,Y., Sisson,I., Sneed,A.J., Sodergren, E . , 

Song,X., Sorelle, R.P. , Svatek,A.F., Taylor, E.W., Taylor, T.R., 

Thelus,R., Thomas, N., Thorn, R.D., Thornton, R . D . , Tong, M . Y . , 

Trejos,Z.Y., Usmani,K., Vargo,C.E., Vattathil , S . , Vega, R. A., 

Villasana, D . , Volkov,A., Walker, D.L. , Wang,Q., Wang,S., 

Warren, J. T., Watt, J. E., Wei,X., Wheeler , D . A . , White, C.S., Williams 

jr,R.L., Williams, A. C . , Williams , G . A . , Williams , J . D . , Wilson, K., 

Woodworth, J.R. , Worley,K.C, Wright, R. A., Wu,J., Wu,W., Yakub,S., 

Yerrapragada, S . , Yu,F., Yuan, D . T . , Yuan,Y., Zhang, J., Zhang, L., 

Zhang, Z., Zhou, J., Zhu,Y., Weinstock,G. and Gibbs,R.A. 

Direct Submission 

Unpublished 

2 (bases 1 to 186029) 

Worley,K.C. 

Direct Submission 
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JOURNAL Submitted ( 12-OCT-2 0 05 ) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 
REFERENCE 3 (bases 1 to 186029) 

AUTHORS Worley,K.C. 

TITLE Direct Submission 

JOURNAL Submitted ( 12-NOV-2 0 05 ) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 
COMMENT On Nov 12, 2005 this sequence version replaced gi : 77539482. 

Genome Center 

Center: Baylor College of Medicine 
Center code: BCM 

Web site: http://www.hgsc.bcm.tmc.edu/ 
Contact: hgsc-help.tmc.edu 

Project Information 

Center project name: LDVL 
Center clone name: CH250-283M17 

Summary Statistics 

Sequencing vector: Plasmid; 
Chemistry: Dye-terminator Big Dye: 
Assembly program: Phrap; version 0. 
Consensus quality: 186011 bases at least Q40 
Consensus quality: 186345 bases at least Q30 
Consensus quality: 186634 bases at least Q20 
Estimated insert size: 205029; sum-of -cont igs estimation 
Quality coverage: 8x in Q20 bases; sum-of -cont igs estimation 



100% of reads 
.990329 



NOTE: Estimated insert size may differ from sequence length 

(see http : / /www . hgsc . bcm. tmc . edu/docs/Genbank_draf t_data . html ) 
The sequence data in this record represents an 'enhanced' version 
of a Phase 2 submission. The indicated order and orientation of 
each sequence has been established using one or more of the 
following: read-pair data from individual subclones, overlaps 
with neighboring clones, alignment with available reference 
sequence (e.g., human), and/or confirmation by PCR testing. 
NOTE: This is a 'working draft' sequence. It currently 
consists of 3 contigs. Gaps between the contigs 
are represented as runs of N. The order of the pieces 
is believed to be correct as given, however the sizes 
of the gaps between them are based on estimates that have 
provided by the submittor. 
This sequence will be replaced 

by the finished sequence as soon as it is available and 
the accession number will be preserved. 

1 54276: contig of 54276 bp in length 
54277 54376: gap of 100 bp 

54377 104140: contig of 49764 bp in length 
104141 104340: gap of 200 bp 

104341 186029: contig of 81689 bp in length. 
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FEATURES 

source 



misc_f eature 
misc_f eature 
gap 

misc_f eature 
misc_f eature 
gap 

misc_f eature 



Location /Qualifiers 
1. .186029 

/organism="Macaca mulatta" 
/mol_type="genomic DNA" 
/db_xref ="taxon: 9544" 
/clone="CH2 5 0-2 83M17" 
1. .54276 

/ note="assembly_name : Contig3 5" 

54277. .54376 

/ note="assembly_name : gap" 

54277. .54376 

/ estimated_length=100 

54377. .104140 

/note="assembly_name : Contig3 9 " 
104141. .104340 
/note="assembly_name : gap" 
104141. .104340 
/estimated_length=2 0 0 
104341. .186029 

/note="assembly_name : Contig4 0 " 



Query Match 24.7%; Score 226.8; DB 16; Length 186029; 

Best Local Similarity 91.3%; Pred. No. 3.1e-58; 

Matches 252; Conservative 0; Mismatches 22; Indels 2; Gaps 



1; 



Qy 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 60 

III I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1523 52 TCTATAGAGGAGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCGGTGGAGAGGTG 152 411 



Qy 

Db 

Qy 

Db 



6 1 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 
I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

152 412 CACTTGGTGAGAAAACTATGGCTCTGCCAACCACCTACGCTAACCTGCTAGGCCTGAGAC 152 4 71 

121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCA — GTCAAGCCCTTAGCTGGCTGCAGCC 17 8 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

152 4 72 TGAGCCACTTTGGAAGCTGATCTTGGAGCACTAAGTTCAAGCCTTCAGCTGACTGCAGCC 152 531 



Qy 179 ACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAA 23 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 152532 CCAGCCAACAATAAGACTGCAACCTCCTGGGGAATCCTGAGCCAGAATCCCCTGGCTAAA 152591 

Qy 239 TTGCTCCTTGATTCTTAACCCACAGAAATTGTGTAA 274 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 152592 TTGATCCTTCATTCTTAACCCACCGAAATTGTGTGA 15262 7 



RESULT 10 
CS074841/C 

LOCUS CS074841 390 bp DNA linear PAT 05-MAY-2005 

DEFINITION Seguence 267 from Patent WO2005030250 . 
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ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



FEATURES 

source 



CS074841 

CS074841.1 GI:63091381 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 

Tuereci,0., Sahin,U., Helf tenbein, G . and Schlueter,V. 
Identification of tumour-associated cell surface antigens for 
diagnosis and therapy 

Patent: WO 2005030250-A 267 07-APR-2005; 
Ganymed Pharmaceuticals AG (DE) 
Location /Qualifiers 
1. .390 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



Query Match 19.4%; 
Best Local Similarity 93.5%; 
Matches 186; Conservative 



Score 17 8.2; DB 2; 
Pred. No. 2e-43; 
0; Mismatches 13; 



Length 390; 



Indels 



Gaps 



Qy 



32 8 ACCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGA 38 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 6 4 ATCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGA 2 05 



Qy 

Db 

Qy 

Db 

Qy 
Db 



3 8 8 AGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCC 44 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
204 AGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCC 145 

448 AGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCAAAAGGACTTCA 50 7 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
144 AGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCAAAAGGACTCAT 85 

50 8 AGTGAAAGACATCTGCAGC 52 6 
I I I III I I I I 
8 4 GGGCAGCGTTATCCACAGC 6 6 



DJ044524 390 bp DNA linear PAT 26-OCT-2007 

Identification of Tumour-Associated Cell Surface Antigens for 
Diagnosis and Therapy. 
DJ044524 

DJ044524.1 GI:158738196 
JP 2007506417-A/200 . 
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SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 390) 

AUTHORS Herufutembain, G. , Shuryuta,V., Sahin,U. and Tureci,0. 

TITLE Identification of Tumour-Associated Cell Surface Antigens for 

Diagnosis and Therapy 
JOURNAL Patent: JP 2007506417-A 200 22-MAR-2007; 
Ganymed Pharmaceuticals AG 
COMMENT OS Homo sapiens 

JP 2007506417-A/200 

22- MAR-2007 

23- SEP-2004 JP 2006527354 
26-SEP-2003 DE 10344799.7 

geert herufutembain, volker shuryuta, ugur sahin, oezlem tureci 



Key Location/Qualifiers. 
FEATURES Location/Qualifiers 
source 1. .390 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref ="taxon : 9606" 



Query Match 19.4% 
Best Local Similarity 93.5% 
Matches 186; Conservative 



Score 178.2; DB 2; 
Pred. No. 2e-43; 
0; Mismatches 13; 



Length 390; 

Indels 0; Gaps 



Qy 



3 2 8 ACCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGA 3 8 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
26 4 ATCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGA 205 



Qy 



3 8 8 AGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCC 44 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
204 AGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCC 145 



Qy 
Db 
Qy 
Db 



448 AGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCAAAAGGACTTCA 50 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
144 AGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCAAAAGGACTCAT 85 

50 8 AGTGAAAGACATCTGCAGC 526 
I I I III I I I I 
8 4 GGGCAGCGTTATCCACAGC 6 6 



RESULT 12 
CS430486/C 

LOCUS CS430486 390 bp DNA linear PAT 19-OCT-2006 

DEFINITION Seguence 267 from Patent WO2006100089 . 
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ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



CS430486 

CS430486.1 GI:116291800 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorclata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 

Sahin,U., Tuereci,0., Koslowski, M. , Helf tenbein, G . , Usener,D. and 
Schlueter, V. 

Identification of surface-associated antigens for tumor diagnosis 
and therapy 

Patent: WO 2006100089-A 267 28-SEP-2006; 
Ganymed Pharmaceuticals AG (DE) 
Location /Qualifiers 
1. .390 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 9606" 



Query Match 19.2 
Best Local Similarity 93.0 
Matches 185; Conservative 



Score 176.6; DB 2; 
Pred. No. 6.3e-43; 
0; Mismatches 14; 



Length 390; 



Indels 



Gaps 



Qy 



32 8 ACCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGA 38 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 6 4 ATCTCTGCTGTGGCCAATGCAGGAATGCTGGCCATCATTGCTTCTGCTGGGCGACTGAGA 2 0 5 



Qy 
Db 
Qy 
Db 

Qy 
Db 



3 8 8 AGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCC 44 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
204 AGCATCACCCACTTCCCCAGAACCTTTTTTACGTGGAGTGAAAACTTTAAGGGGCTGTCC 145 

448 AGCTAAACCTCCAACCTCCAGATCCCATGCCAATTTCTCTGCTTCTGCAAAAGGACTTCA 50 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
144 AGCTAAACCTCCAACCTCCAGATWCCATGCCAATTTCTCTGCTTCTGCAAAAGGACTCAT 85 

50 8 AGTGAAAGACATCTGCAGC 52 6 
I I I III I I I I 
8 4 GGGCAGCGTTATCCACAGC 66 



AR612609 561 bp DNA linear PAT 15-DEC-2004 

Seguence 108 from patent US 6828097. 

AR612609 

AR612609.1 GI:56668431 
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SOURCE Unknown. 

ORGANISM Unknown. 

Unclassified . 
REFERENCE 1 (bases 1 to 561) 

AUTHORS Knoll, J.H.M. and Rogan,P.K. 

TITLE Single copy genomic hybridization probes and method of generating 

same 

JOURNAL Patent: US 682809 7-A 108 0 7-DEC-2 0 0 4 ; 

The Childrens Mercy Hospital; Kansas City, MO 
FEATURES Location/Qualifiers 
source 1. .561 

/ organism= "unknown" 
/mol_type="genomic DNA" 



Query Match 13.3%; 
Best Local Similarity 69.6%; 
Matches 201; Conservative 



Score 12 2.6; DB 2; 
Pred. No. 4.7e-26; 
0; Mismatches 74; 



Length 561; 



Indels 14; Gaps 



Qy 

Db 



2 CTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGC 6 1 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 01 CTCTGGGGGAAGCCAGCTGCCATGTCATGAGGACACTCAAGCAGCCCTGTGGAGAGGCCC 26 0 



Qy 

Db 

Qy 

Db 

Qy 
Db 



6 2 ACTTGGTGAGAAACCGATGCCT-CTGCCAACCACCTGCACTAACCTGCTGGGTC 114 

I III I I I I I I I I I I I I I II I I I I I I I I I I I I I I II 

261 ATGTGGCAAGGAACTGAGGCCTCCTGCCAACAGCCAGCAAGGAACTGAGGCCTCCTGCCA 32 0 

115 TGAGACTGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGC 16 7 

II II I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I II 
3 21 ACAGCCATGTGAGTGAGCCATCTTGGAAGCAGATCCTCCAGCCCCAGTCAAGCCTTCAGA 3 8 0 

16 8 TGGCTGCAGCCACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATC 22 7 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I 

381 TGACTGCAGCCCCAGCTAACATCTTGACTGCAACCTCATGAGAGACCCTGAGCCAGAACC 44 0 



228 CCCTGGCTAAATTGCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGA 276 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
441 ACCCAGCTAAGCTGCTCCTAAATTCCTGACCCACAGAAACTGTGAGAGA 48 9 



AR799123 561 bp DNA 

Sequence 108 from patent US 7014997. 
AR799123 

AR799123.1 GI:91151193 



PAT 05-APR-2006 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 561) 
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AUTHORS Knoll, J.H.M. and Rogan,P.K. 

TITLE Chromosome structural abnormality localization with single copy 

probes 

JOURNAL Patent: US 70149 9 7-A 108 21-MAR-2 0 06 ; 

The Children's Mercy Hospital; Kansas City, MO; 
US; 

FEATURES Location/Qualifiers 
source 1. .561 

/ organism= "unknown" 
/mol_type="genomic DNA" 

ORIGIN 



Query Match 13.3%; 
Best Local Similarity 69.6%; 
Matches 201; Conservative 



Score 12 2.6; DB 2; 
Pred. No. 4.7e-26; 
0; Mismatches 74; 



Length 561; 



Indels 



14; 



Gaps 



Qy 



2 CTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGC 6 1 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 01 CTCTGGGGGAAGCCAGCTGCCATGTCATGAGGACACTCAAGCAGCCCTGTGGAGAGGCCC 26 0 



Qy 

Db 

Qy 

Db 

Qy 
Db 
Qy 

Db 



6 2 ACTTGGTGAGAAACCGATGCCT-CTGCCAACCACCTGCACTAACCTGCTGGGTC 114 

I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

261 ATGTGGCAAGGAACTGAGGCCTCCTGCCAACAGCCAGCAAGGAACTGAGGCCTCCTGCCA 32 0 

115 TGAGACTGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGC 16 7 

II II I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I II 
321 ACAGCCATGTGAGTGAGCCATCTTGGAAGCAGATCCTCCAGCCCCAGTCAAGCCTTCAGA 38 0 

16 8 TGGCTGCAGCCACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATC 22 7 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I 

381 TGACTGCAGCCCCAGCTAACATCTTGACTGCAACCTCATGAGAGACCCTGAGCCAGAACC 44 0 

228 CCCTGGCTAAATTGCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGA 276 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
441 ACCCAGCTAAGCTGCTCCTAAATTCCTGACCCACAGAAACTGTGAGAGA 48 9 



DD052947 561 bp DNA linear PAT 04-NOV-2005 

SINGLE COPY GENOMIC HYBRIDIZATION PROBES AND METHOD OF GENERATING 
SAME. 
DD052947 

DD052947.1 GI:92688538 
JP 2004523201-A/108 . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
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REFERENCE 1 (bases 1 to 561) 

AUTHORS Joan,K.H.M. , Rogan,P.K. and Kazarro, P .M. 

TITLE SINGLE COPY GENOMIC HYBRIDIZATION PROBES AND METHOD OF GENERATING 

JOURNAL Patent: JP 2004523201-A 108 05-AUG-2004; 

KNOLL H JOAN, PETER K ROGAN 
COMMENT OS Homo sapiens 

PN JP 2004523201-A/108 

PD 05-AUG-2004 

PF 15-MAY-2001 JP 2001585298 

PR 16-MAY-2000 US 09/573080, 14-MAY-2001 US 09/854867 PI 
knoll h m joan, peter k rogan, patricia m kazarro CC 
FH Key Location/Qualifiers 
(1). .(561) 
/note='mltlf 1 ' 
(62). .(62) 



FT 
FT 
FT 
FT 
FT 
FT 



repeat_region 



misc_f eature 



misc_f eature 



FEATURES 

source 



/note='n is a, 
(165). .(165) 
/note='n is a, 
Location /Qualifiers 
1. .561 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref ="taxon : 9606" 



g or t ' 



Query Match 13.3%; 
Best Local Similarity 69.6%; 
Matches 201; Conservative 



Score 122.6; DB 2; 
Pred. No. 4.7e-26; 
0; Mismatches 74; 



Length 561; 
Indels 14; Gaps 



Qy 



2 CTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGC 6 1 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 01 CTCTGGGGGAAGCCAGCTGCCATGTCATGAGGACACTCAAGCAGCCCTGTGGAGAGGCCC 26 0 



6 2 ACTTGGTGAGAAACCGATGCCT-CTGCCAACCACCTGCACTAACCTGCTGGGTC 114 

I III I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I II 
2 61 ATGTGGCAAGGAACTGAGGCCTCCTGCCAACAGCCAGCAAGGAACTGAGGCCTCCTGCCA 32 0 



QY 
Db 

Qy 
Db 

Qy 
Db 



115 TGAGACTGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGC 16 7 

II II I I I I I I I I I I I 1 I I I I I I I I III I I I I I I I I I I I I II 
3 21 ACAGCCATGTGAGTGAGCCATCTTGGAAGCAGATCCTCCAGCCCCAGTCAAGCCTTCAGA 3 8 0 

16 8 TGGCTGCAGCCACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATC 22 7 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I 
3 81 TGACTGCAGCCCCAGCTAACATCTTGACTGCAACCTCATGAGAGACCCTGAGCCAGAACC 44 0 

22 8 CCCTGGCTAAATTGCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGA 2 76 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
441 ACCCAGCTAAGCTGCTCCTAAATTCCTGACCCACAGAAACTGTGAGAGA 48 9 
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